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© Expression systems for amidating enzyme. 

© Alpha-amidating enzyme is produced by recombinant DNA techniques recoverable in high yields and at high 
purity. Both eukaryotic and prokaryotic expression vectors are provided having a transcriptional promoter 
followed downstream by a DNA sequence which encodes amidating enzyme. The vector selected is one capable 
of directing the expression of polypeptides in the host selected, and preferred hosts are transected with the 
described vectors. 
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EXPRESSION SYSTEMS FOR AMIDATING ENZYME 

BACKGROUND OF THE INVENTION 

This invention relates to the production of a.pha-amidating enzymes through recombmant DMA tech- 
niques. and particularly to expression vectors and hosts capable of express.ng alpha-am.dat.ng enzyme .n 
hiqh yields and at recoverable high purity. 

The intracellular processing (cleavage and/or functional group modification) of precursor forms of nat.ve 
proteins following their translation from nucleic acid coding sequences has been clearly d ^ me ^. 
In general, mammalian cells and other eukaryotes can perform certain posMranslat-ona. process-ng 
, procedures, while prokaryotes cannot. Certain prokaryotes. such as E. coli. are widely employee las hosts 
for the production of mammalian proteins via recombinant DNA (rDNA) technology ^ 
readily grown in batch fermentation procedures and because they are generally well-cha^c te nzed 
However many mammalian proteins require some type of post-translational processing, andjf these 
proteins are produced by genetic engineering of E. coll. for example, the post-translational , 
5 often be accomplished by using complex, in vi^mical procedures which are cost- probative for large- 

SCa OnC^ ^S-** involves the specific amidation of the carboxyl-termina^ I amino acid of a 
peptide or protein. Many naturaHy-occurring hormones and peptides contain such a mod. tication whjch ,s 
often essence, if the protein is to be biologically active. An example is calcitonin where £ 1 « 

o non-amidated proline residue for the amidated proline of the native form results m a 3.000-fold ^uct.on n 
logical activity. Other biotogical peptides requiring post-translationa. amidation for full activity mclude but 
are not limited to growth hormone releasing factor, other calcitonins, and calc.ton.n gene-related ^pe ,ptde. 

The specific amidation of the carboxy.-termina. amino acid of a protein is cataly, 3 d by alpha-a* gating 
enzymes The polypeptide sequences for many important biological proteins wh.ch requ,re am.dat.on for 

?5 efficacy, may be manufactured, for example, by genetic engineering techn,ques^ However, the 

Sent and sometimes essentia, carboxy. terminal amidation must often be » P*^" 2g Jf « 
desirable to avoid costly and cumbersome chemical amidation techniques at this pent. 
desirable to utilize an amidating enzyme to perform the specific amidafon. However. a.pha-am.dat.ng 
enzvme is not easily obtained in nature. i Alftt i e 

30 The presence of amidated peptides in a particular tissue is not necessarily «™^J^£* 
of alpha-amidating enzyme. For example, rat anterior pituitary tissue contains h.gh 
but no known substrates [Eipper et al. PNAS 80. 5144-5148 (1983)]. Rat postenor p. u, ^ ^ commns 
amidated peptides (oxytocin and vasopriisF) but has very little alpha-amidat .ng act. v.ty [E ^ 
116. 2497-2504 (1985)]. Therefore, until individual tissues are tested for alpha-am.dat.ng act.v.ty. the 

35 presence or potential levels of the enzyme cannot be anticipated. 

An even greater impediment to the availability of amidating enzyme obtained from natural sources s 
the usually low level of purity. Amidating enzymes obtainable from natural sources are contaminated wrth 
proteose enzymes and other impurities Effective recovery of amidated product is greatiy hmdered when 
me e impurity-Led enzymes are used to amidate a substrate comprised of L-am.no 

4 0 of proteases, in particular, may break down the substrate and/or the am.dated product and/o ^ h f a ^f a ^ 
enzyme itself. Most biologically important polypeptides comprise L-amino acids, and are ^^*J« 
proteolytic breakdown and to other amidation-hindering impediments caused by .mpunties .n am.dat.ng 

"TecaTnature' provides few sources, low abundance and insufficient purity of alpha-amidating 
« enzyme, there is a need for efficient methods of mass producing alpha-amidating enzyme recoverable in 

"TIThereTrtLs "amidating enzyme- and "alpha-amidating enzyme" refer to any agent 
capable of catalyzing the conversion of a peptidyl substrate to a corresponding peptidyl am.de hav.ng an 
amino group in place of the C-terminal amino acid of said substrate. 
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BRIEF DESCRIPTION OF THE INVENTION 



It is an object of the present invention to provide alpha-amidating enzyme recoverable in high yields 
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^,„«w,. WM «.« CT o«.«e»«T«TTC0.crm» 

tv-;- 4 ™ -rrr;"; r; ty ; .-.,;....«.-->»« 

TeacweeeT S Tcm«e»Tete»c»ecfatoecec» 6 «Tii«tfCM«cc» T e««tT»t 

; ; : ; ; ; t ; . •• - » » •.*%••••*••■ • h 

0 <«c»T..ACTccc.cceAo.KioT.mACTC W ».eoo.coA^ •» 
. « . • . • .- — • J "*! "" ! , , i . i i • I T I lilit t> 

TC«OT«fT»eiTee»*eec««.t.t«»«necxte»t»«etct«e««»»cctcwteM«mon^ »» 

~r v7vrv'r;vrv r; rrrv: -rr, *. . r • a 

5 cccA»meTerc«««cccciCATCcmKAi 6 c.cecccAtcAc.AAeucc«^ ••»» *J 

n-TTTTrnTTt'TlTr. i i » r i ♦•«..•»•• - •. «""o 

20 CT AAC»»,.Acm<KAm,cc»e« e .ecKAcc»t«n.ccc.Ke;..c,™ »" g 

■~- - "i-;-;-;- tttTi ..«*.* » • » « < » • « • » 1 <p 

, e A.oAA,.cAcc.,ccTccTCAmAccc..ATA.T=c,CA»Kctc^ »». ^ 

*:-;"V;tttt v;t ; ;..•..•«»»*>••••• m 

KAc.T«A.AmTT««CACACAmc«acTCCAC«cc,«T«c.;.^ "» 

25 . - - ; ii, k .k.triit>* r * a ,51 " 

IecAAc^«T«.cc»A..«Am«cceA^^^ ™» 

— .„...-..-.«-....--.----.----.■---♦■--••"; . . ».» • •»««•<•»«■ • 
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CTCC*CTCCCAnCTCnTAtA10AACTT*CCCTW*0AC«aciTtXTCUCCM 21SS 
^CnCCTCCATCT**AT*CCCCT«TClCKCTKJACACAOJM »» 
rACCAncTC«WTCTCCCACfCAA0«AAaA ; 2"CCJC*CC*« 3ISS 

H , V ■ . - " .. . 3"* 
stringent conditions with a DNA sequence ol: 
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W c»c»ccAccTcccTccccTTCACA.ccccAcm^ 

_.„........_....-_...-.- .;--; -J-- f , » . » » r . « . . , . I , I . • 

ACCA«T«CCCCTmO»T«C»TC»C.Tm^ * 

c W ecT 6 T e c»K«e«oecTtmemc.CTi«KceTmccc»ee«ce»TACTcrce^ > 

nrrr r.T;~T;Tr. ;j » • i •••••••»• • «« 

«ACTe«»cm CT e«mt«wmuw«»«mK«CATAA» e ««^ * 

— . — ♦ — ^~--,-_-.--"4~--.---.----"-- ~j~ 

ACcVcTOCCATTCACAOTTWACCACAAACTCCAACCAAATACTfCCTCCTICAACftCACTAtCCCTAM » 

wcmoemcteccfAeATmACAcoTotcccceAcccmAAnw^ « 

„...„...-.-..--.__. - r ---.- ; --..- rr » »--;-; I ■* • * 

.ACTACfCAATeCTeACAme0TCCCAAT.CAA.AmAICC.ATCCATCTOm«^ ' 

»tACA 0 A C TA»e»AAeeCACACK«C»CmmCACCCC»eA«Ce^»«:K^ ' 

t TTwrrrri Tvr;~ rri > i r • 

emATATACTCCeACeCACAtCTCTCmACTCCTCAACeCACeACAOAMCCA^ACATC^ 

.„....--,-..-.---.---.----♦--- - , t „ , , , t • . a . . « « i r i 

, TC »KT»m«T0«..ccmnTC«cm«n«T0»c^^ "»» 

cecAAtreeTOtCA.ACc W eAT^.T.»T«eAco W »KACA«c«ce.,A.« »» „ 

-TiTTJ-TTTTrriTrS-l ....».»••»• < « » * « « 
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As used herein, the term "stringent conditions" means 2 x SSC (0.3M sodium chloride and 0.03M 

^T^sent' Mention a.so provides expression vectors for directing expression of a-P^"^ 
enzyme Sp orotic and eukaryotic systems. For example, an expression vector ., » P^j£* • 
capable of directing, in a prokaryotic host, the expression of a po.ypept.de sequence of an a^ha am.dalng 
enzvme said vector comprising a transcriptional promoter followed downstream by a first DNA sequence 
ha ing an amida ng enzyme-coding region, said first sequence being sufficiently homologous to , . na u a 
D^IA sequence fo? expressing natural amidating enzyme to undargo hybnd.zat.on with 
seTuenTunder stringent condLs. and said first sequence including an initiating meth.on.ne codon w,th.n 
about 50 nucleotides of the start of said enzyme-coding region. 

Likewise, an expression vector is provided, in another embod.ment of the .nvention wh«* « <**™ 
directing the expression of a polypeptide sequence of an a.pha-amidating enzyrn ^« ^ 
vector comprising a transcriptional promoter followed downstream by a first DNA "^J™*^ 
amidating enzyme-coding region, said first sequence being sufficiently homologou t a tu a. DNA 
sequence for expressing natural amidating enzyme to undergo hybnd.zat.on with said natural sequence 
^conditions, and said first sequence including a stop codon upstream from a sequence which 
would otherwise code for a membrane spanning domain. 

This first sequence should be followed by a sequence specifying the addition of poly a 
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messenger RNA generated by transcription from said ^prompter determined by 

As used herein, the term "membrane spann-ng doma « s a DNA^ equenc mJ c ^ 
the test o, Kyte l . D** ±M*g V* ^ JJ£^^ hydrophobicity. .ength. 
hereby incorporated by reference), codes tor an ammu «. m examole this may occur as a 

structural character, and the like to become fixed ,n the memb n. For xample t y^ ^ 
protein is synthesized on a membrane-bound "bosome or q { which it is a 

set .orm together «ith base mimbe, rbferances. For 1 r,«LrTbein, M Ktst 

are consecutively numbered with the first being the am.no ac.d expressed by bases 1 3. 

BHIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a flow chart for construction of a mamma.ian expression vector for alpha-amidating 
enZym Rgure 2 is a flow chart for construction of a prokaryotic expression vector for alpha-amidating 

, figure 3 Jessie ^ 

rlSfSfi ^^"dTtot: growth medium <C = inso,ub.e proteins of £«. 

JM1 ° 5 Fia a u7e in 4 9 isTwestern Blot of the ge. shown in Rgure 3 wherein, foilowing protein transfer to nitro 
■ ce,.,ose 9 the *i JTSLd with rabb?t anti AE antisera and the alkaKne phosphatase-con.ugated ant, 
rabbit Ig. followed by chromogenic sibstrate for the alkal.ne phosphatase. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
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„ aecotttaoee with an. itweotion. ^tllSC 
»,c,„, s suitable tor eukaryotic systems ate ptepatad ONA ^**0 No 

cDNAs for amidating enzyme. „ rana , oH fnr nxamole from the total RNA of rat 

The isolated cDNAs have been used to screen "^P^^^JK^ produC e amidating 

polymerase. The cDNA was used to generate cDNA Lbrar.es .n the vector ,gt 
DNA-s were packaged in vitro to form infectious bacteriophage 'P^les- 

Extracts for packaiinTare commercially ava.lable ^ «™ Pt ^j2l saeen96 ^ 

Laboratories or can be prepared according ^^^^ n ^X^^ containing 
ra«o.abe.^ of an d 

S Ih^orn 6 ^ with - 0^ -e radiolabe-led AE-specifc 



EP 0 382 403 A2 

i 



oligonucleotide probes conferred specificity. 

Oligonucleotide probes denoted AE4. AE5, AE8 and AE9 at pages 61-64 of EPC appl.cat.on No. 
88307533.5 and published a* Publication No. 0308067 are especially preferred when screening libraries 
prepared from biological deposit IV1 10029 identified above. hrMi , atinn 

s Analysis of the AE cDNAs from many bacteriophage isolated by the above oligonucleotide hybndization 
screening procedures indicated that the cONAs could be separated into a plurality of distinct types. The 
structure of one type is given below as diagram A. where the nucleotides have been numbered with base 1 
as the first base of the codon for the initiator methionine. Below the nucleotide sequence is given the single 
letter amino acid code for translation of the gene sequence into protein. Numbers in parentheses at the end 

io of lines indicate amino acid numbers: 



Cccr«cc«Tcc W c«cmcc^ 



20 



25 



35 
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««cA W ccTocaccecm»c«ccc« e mcKtc™ 

......... r t t r .......... . 

.cc^^cuwccAKwrnnccecKeu^ 

cTCCcTeT«cm*ee.»eccTTC CTM m«cm^ * 

r; Trr r v -rrr v: . « . • 

e«cTc*»«mcTc«TiTTOT«T.»»«mcTCT«.«m»^^^ ' 
...... ..........................J. ~ . • • » • * ' » ' ' ' » 

W ermocm««cmo.o«*o»AKTCO«cc«mcTT«TC^ > 
,ccreTceceT.Tem*«mc*«ccTeTecccc^ .« 

._. ...... . • , , . • • « 

„,T/™«T«CT«CA»TTWTCCM«K»mTOT.^ 

TTTTrrrr'nTr; *r; r« « » « • « r « 

._...-....--.-.» --.-» , .... i , i 
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«.mm.c»TcoA..ec...T.mAcim^ ,,M 

rrv rrr; v rr rri i . » . • ' « • * m 

.ecMnecTOK.A.ceco.c.TCcn^ 

.......... r -.- r; - r - r; -rr;-r;-i ...... » - < • 

r~-m* r;*-i"rT*r; : : •••••• r •• i 

■r -;' -prm tt« i . ■ • ■..««•»««» < « » • * .■• p 

T„ M AA«ACAccA«CTccKAueA«^^ * 
"J"W V'l ' "I « ■""•* •" » " » ;.•..»•» « ' » »»•»•» * 

» ACAC.KCAA.nmCCCTCACACAKT^^^ 

r;r: 7-TiTr; ui unnf « . 

TccA.ccTC«CACTc.cc.AAACCATn^ 

— . — . — . — ♦ — -.«--♦ — ---J-;-;-;-'; < 'p«tt*'>* > ' are '*' 

Ti •;•*;**; r; '. ; *;* tt; T*. "'%"• . •• . • ; . » r * • « 1 • • » » 
TriTrrrrTrrTT"; « * . « • « • » • • - - » 1 ■ * « • 

* • *""*"'^"" • I, f t ' ■ I p I I • • * ^ 1 1 ' * " ' 

caLLLIaccL^ 

— . — ♦ — . — • — -. — ♦ — ;"7 1 I* t tiiv*r»i* llBiir 

cU^a*™.!^ 

^AOTCCCAMCTCmATATCAACTTAWGtACACJ^ IMS 
CTTAm«CTCCAICTAAAtAC(XCTAnrcKCTK»ACAlA^ 

t \ .'•!■ ' . . ' . 5,14 

Another type of cDNA sequence isolated by the oligonucleotide hybridization screening set forth above 
is represented by Diagram B below where the numbering and other conventions are the same as those 
stated for Type A above: 
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CWATCCWCACTCCCACCACCTT^^ "J*J 

tcccccccccccccccacto 39 

— . — ♦ — . — « . ♦ —.4-. - . ..-.#..--.—-♦.--- ■---{---• ™;*™f T V"I ' T~ I u*» 

TCCACACCACCTCCCTCGCCTOACAAC^ 155 

ACC A CTC * CCCCTCTTC ATCCA TCi.C ATTTTCCCCTCC A TA * ^^^^^ IT* 4*^!* It ^^1^1 -1 -1 1 t^TIT-~- ♦ I^^J 

CTCCCTCTCCAT1SACCAACCCTTCCT0ATTCACTTCAACCCT 155 
^...—.♦^.—♦.— .-.^-.....•-♦..-•.-^ ■ ■ P* • • «»*» 

CCACTCaAACttACTCCTTTTCTOre^^ 455 
75 ACCTCTTWATTCACACTITCCACGACAAACTCCAAC^ 555 

7\"i"*r"rTrri i* m i" J 7 » i"5'f k ri 4 1 11 1 rVi k i < 

TgCTCTCC W IOTC Cq TACltCTCACACOtCTCCCCCACCCTmmO m 

~l . j— - • r*;*"~i"T* J~ * Y" J T ' rVTVT I /'i 

A ACT A C TO A A TCCTC AC A TTTCCTCCC A A T AC A A A A TCT A Y^CCT A A ^ «3 2 ♦^3— — 3 ™ 

ATACACACTAAOUACC^CACTCQACACTCAT^ W5 

CCmTATACTCCCACCCAOATCTCTCTm 1SS 

---^Y^r'rTTYT'rTTTTTTTYTYTT"* r*r% • ■ • » »" » 4 <m * 

TCATCTAmCATOCUCCCAAATATCCACTTTCCTTCA 1055 

~*rT"~i™*i~*"™i~"'i i' i V i i » i i 'i i i.r » l M • • " » 

CCCAATTCCTCTCAAA^A«TCOTTATCATCCA«CW 1158 
30 GACCAAOAACTATTAGACCACcWcATTTCTATTCACTGCf TTW 1255 

tass 
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ATOCOCTUTOaATOTACTCAO^ 



TCTCCTTCCACCAO 1433 
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ccTccccnwccmranccACAAccm 

CC0TCOCCCWCAnrTMCUTUCCTAOT0*TmCC4CACA0CT0A U " 

ACCTCTTCCCCCAATTCAACAACACACCATCCTCCTCATTCACCCAA 4755 

CCCTTCACCATACATACAOATCCAAATTATTCCCTCACACATCTOCCTCTC^ 1,55 

riTTT'MTrr** it rT*rwvv rv~< > * • * i • > r i » 

TmCCCAACCACCATCCAACCTCCCACTCACCAAMTCATna^ACCWACCCU^ 1,55 

T l V riTTi "™S"«""«"**T"i~« J l I • i » • t i t I ■ » « » • < 

CCCTTACTCTAACACTCCQATTTTIXCACTTTTCACCAACCCCAAA<n^ 1055 

WTTlTiTT M "VTY "J* i » I M I • • • • I « • » r M 1 

TTCACTCTTCCreACACTTTCCCCCTTCTCCCTCATTTCCACCACnrTOT 1153 

AACAATTTCTCACAQAflATTAACCACCCATCATTTCQAACOAATOTCTTTGC^ 1155 
CmCCACACCAACAtXCCOTCCAACCATmTCATCAACTTnCCM 

CATCATATTGTGCCTTCTCAACATCCCACTtnC^ 2455 

CACTCAAAAAOCCTWCArTCAACTCCAGOAAATCAAA^ 2555 

» r k* i i T*^ • i >» • 7 • » t f k f i » M • • • 1 < fc ■ 

CCAAGAGAAACACAAACTCACCACACACCCCCCCTCCCC^ *« 5 
CTCATCTTTAfTCCCTCCAAAAAATCAAGCCCCTCTCCAM 2,55 

ri r r*r " i'V ri r*i . , , , , , , , f i ; r r i a 

CACCACCAAGAACACAACCAAMCCAACACAACCACCAACAAAAW^ «»* 

• i r r "i"t *~t* r /"i 'i r fe r t • I r . <iaa> 

ACCCCTCTCACAATAACTACAAGCCCAAACW^^ "* 5 

=r,an^^^ 

assoc^ed" Tnd "^swH^" « $ ^' 9 «™ $n °» ""j 1 ** I^SiutbMW^S-Ss'S 
domains appear in the two cONA exarnp.es s« pnh rn *^ ^ B). Tbe 

^aTp^arA'^sTp^S 

S the exp^oo of «MI appears lo be an intron region torn base 1 178 Urorgh baae 1492 of lire type 
CDN T A ho oONAs o. diagrams A and B above enoode protein £££ 
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preferred that the gene sequences that encode the natural secretory signal sequence be removed, that an 
initiation codon be placed within about 50 nucleotides of the gene sequences encoding the start of the 
mature protein corresponding to alpha-amidating enzyme, and that the gene sequences encoding the 
membrane spanning domain in the C-terminal region not be translated. The initiating codon js o course .n- 
s frame with the sequence which encodes the enzyme and. .n some preferred embodiments, .s upstream 
'rom the region, sometimes immediately upstream. 

When the AE cDNA was expressed in E. Coll. it was discovered that the natural gene sequence 
contained a cryptic E. Coli ribosome binding" site ("RBS") and initiation codon internal to the natural 
initiation sequences. fhisTisulted in the production of an N-terminally truncated amidating enzyme protein. 
,o While this did not prevent the production of the desired product in E. Coli. the coexistence of the correctly 
initiated and internally initiated products complicates the processing and purification of the recombinant 
product to a useful form and is therefore undesirable. To eliminate the unexpected, undesired product, it 
was necessary to eliminate either the ribosome binding site, the internal initiation codon or both of these. 
For example, in certain preferred embodiments of the invention, a valine codon which, in prokaryotic 
is sys tems. codes an initiating methionine, is altered by a point mutation to an equivalent non-.mt.at.ng va line 
codon at bases 661-663 (of the cDNAs of either diagram A or B). In lieu of this point mutation or in add.t.on 
thereto, applicants, in other preferred embodiments, delete or substantially modify any region coding for a 
ribosome binding site which occurs just upstream of an internal initiation site, and more preferably any 
ribosome binding site whenever one may occur. These modifications are made to substantially eliminate 
20 internal initiation such that the protein expressed because of interna! initiation is not observed as a separate 
band following electrophoresis. 

To obtain expression of secreted, active alpha amidating enzyme protein, from a recombinant 
eukaryotic host cell line it was necessary to remove the gene sequences encoding the transmembrane 
domain found in the C-terminal region of the natural gene sequences. For the type A cDNA this has been 
25 done by truncation of. the protein coding region through introduction of a stop codon at or near to where the 
natural amidating enzyme is post-translationally processed in some natural systems as explained in detail 
below For the type B cDNA this has also been done by introducing a new stop codon in the region of the 
enzyme protein where the natural type B amidating enzyme is post-translationally processed (see below). 
This should not be taken to exclude the possiblity that in some host cell systems it may be preferable to 
ao express the entire naturally occuring gene sequences. Because the type B cDNA contains sequences with 
the characteristics of an unprocessed intron there may be a difficulty in expressing this cDNA in some 
eukaryotic host cells. These cells may not efficiently produce an mRNA from the type B gene due to the 
presence of the paired splice donor and acceptor sites. Elimination of the acceptor site might therefore be 
necessary to allow for efficient expression of type B AE cDNA 
35 We have discovered that the carboxyl end of the naturally occurring 75 kD alpha am.dat.ng enzyme 
protein occurs beyond amino acid position 709 (814 of type B). To produce the 75 kD protein (87 kD o 
type B) in a recombinant DNA host cell, a stop codon has been introduced into the cDNA by mutation of 
the codon for the lysine of amino acid position 716 (821 of type B). This modification has been made using 
oligonucleotide directed site specific mutagenesis. Such mutagenesis can be accomplished in a variety of 
40 ways The methods have been reviewed extensively in the molecular biology literature. The general method 
that we have used was described by Taylor. J.W. et al. (1985). Nucl. Acids Res.. 13: 8749-8764; Taylor. 
J W et al. (1985), Nucl. Acids Res.. 13: 8764-8785; Nakamaye. K. and Eckstein. F. (1986). Nucl. Acids Res.. 
14: 9679-9698. The reagents neededlo practice this method are available in the form of a mutagenesis kit 
from Amersham Corporation. ... . 

45 The mutation of the sequence that we have produced changes the AAA lysine codon to a TAA stop 
codon The oligonucleotide used for the mutagenesis incorporated this change but was otherwise identical 
in sequence to the naturally occurring cDNA sequence for the respective enzyme (type A or type B) being 

mutated. . 

We have also discovered that a naturally occurring shortened form of the alpha amidating enzyme 

so protein is produced by processing of the type B protein at the internal region of the protein that is unique to 
the type B enzyme protein. This results in an enzyme product that is approximately 43 kD in molecular 
mass. Without intending to be bound by theory, it is believed that the DNA sequence upstream from the 
intron region is sufficient to code for a polypeptide capable of exhibiting significant alpha-amidating activity. 
Accordingly polypeptides which are easy to recover and which are capable of expressing alpha-am.dafing 

55 activity may be encoded by cDNAs which are significantly truncated by placement of a stop codon 
somewhere in the intron region of type B cDNA in just before or after the corresponding location where this 
intron is missing from TYPE A cDNA. Preferred truncation results from placement of a stop codon within 
about 30 bases of the beginning of the of the intron region, preferably immediately downstream therefrom. 
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To enable the production of one preferred short form of alpha amidating enzyme protein in recombinant 
host cells a modified cDNA is created having a stop codon in place of the lysine codon at amino acid 
position 436 of the type B cDNA. This mutation was accomplished by oligonucleotide directed site specific 
mutagenesis of the type B AE cONA. , 

While the shortening of the amidating enzyme protein by introduction of the stop codon at amino acid 
oosition 436 of the type B cDNA gives a protein that most closely approximates the one produced naturaHy 
bv proteolytic cleavage of the primary translation product (or some other cleavage intermed.ates in the 
biosynthetic pathway), a further shortening of the amidating enzyme protein may also result in production of 
an active product in recombinant DNA host cells. We have modified the AE cDNA in several other ways to 
create such shorter forms of protein. In one example, we have used oligonucleotide directed site-specific 
mutagenesis to convert a tyrosine codon at amino acid position 396 of the type B cONA to a stop codon. 
This change will result in a protein that is approximately 39 kD when the cDNA is translated and processed. 
In a second case, we have utilized the naturally occuring Bam HI enzyme recognition site of the type B 
cONA to introduce a stop codon by linker mutagenesis. This method is well known in molecular biology and 
simply" involves the cleavage of the cDNA followed by ligation to a double stranded synthetic linker 
fragment that is complimentary to one end of the cleaved cDNA and that introduces an in frame stop codon 
just beyond the cleavage site. We have used an oligonucleotide fragment with the following sequence to 
accomplish this modification: 

20 5 ' GATCCACTAATGATCA 3 ' 

3 , GTGATTACTAGTTCGAg , 

25 This linker introduces a stop codon following the histidine codon at amino acid 469. Translation and 
processing of the cDNA once it has been modified in this fashion results in the synthesis of a protein of 
approximately 46 kO. 

Preferred placement of a truncating stop codon is within about 15 bases of a DNA sequence which 
codes for consecutive basic residues (usually a Lys-Lys) and especially immediately upstream therefrom 

30 Without intending to be bound by theory, it is believed that the natural polypeptide coded by the cDNAs o 
type B is processed, during post-translational modifications which occur during natural expression of 
amidating enzyme, at or near such consecutive basic residues, for example, the consecutive lysines coded 
within the intron region of the cDNA of diagram B. Even when the inserted stop codons are not intended to 
truncate the expressed polypeptide in the above-described manner, it is preferred that the inserted stop 

35 codon be placed within about 20 bases, and preferably immediately upstream from. DNA sequences coding 
for consecutive basic amino acid residues. Insertion of stop codons at these positions will likely result in 
expression of a polypeptide resembling certain natural amidating enzymes after they have undergone post- 
translational processing. 

For cytoplasmic expression in prokaryotic systems, any signal sequence coding regions (for example. 

40 the first bases of both the type A and type B cDNAs diagrammed previously) are preferably eliminated and 
a methionine initiator codon is inserted within about 50 nucleotides of the beginning of the region which 
codes for amidating enzymes. ° 

An alternative embodiment for prokaryotic expression eliminates any coding sequences for signa 
sequence or proenzyme sequence and inserts an initiator methionine codon within about 50 nucleotides of 

45 the beginning of the region which codes for amidating enzyme. In many natural AE cDNAs, this 
corresponds to the beginning of the region which encodes ser-x-ser (X being phe or leu). See. for example, 
bases 124 to 132 of the sequence for type A or type B cDNA. In some embodiments secretion of alpha 
amidating enzyme may be desirable. In this case it is preferable to retain the signal sequence coding 
regions, or alternatively to replace them with heterologous sequences that can serve the same function, for 

so example, the signal sequences of the bacterial OMP A protein. 

It will be readily apparent to those skilled in the art that numerous mutations and truncations of the DMA 
sequences set forth herein for encoding amidating enzyme are possible within the scope of the invention 
and that such modified ,quences would code for polypeptides capable of functioning as a™™^ 
enzymes. Accordingly, applicants* claims should be construed to include all functional equivalents of DNA 

55 sequences, expression vectors and host cells specifically set forth. 

Examples of prokaryotic expression vectors which may desirably be modified to include DNA se- 
quences encoding amidating enzyme in accordance with the invention include but are not limited o 
PKK233-, pKK322-2. pPROK-1. pkT279.280.287. pPL lambda. pYEJOOI. pKC30. pPROK-C. all commercially 
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available. Prokaryotic hosts which may be transfected with express.cn vectors in accordance with the 
invention include but are net iir-.ted to C600. LE392. RR1. DM, SF8. all commer.cally available. 
^SilSnSlSton vectors which may desirably be modified to include DMA sequences encoding 
^S enzT^TlccortMce with the invention include but are not limited to pMAMNeo. 
^ItnTpRSV. PeuKCt. P CH110. a., commerically available. Appropriate yeast vector, r al so 
be used Preyed euro^yotic hosts may be transfected with expression vectors .n accordance wrth the 
nventn inc delut are no" limited to IV. deposit 10029. Hela. CV1. C127. CHO (Chinese Hamster Ovary) 
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and COS. 

EXAMPLE 1 



Expression of Alpha Amldating Enzyme P roteins in E. Coli 

In order to express alpha amidating enzyme in E. Coli (see the flow chart of Figure 2). a cDNA , r agment 
having the sequence set forth in diagram A. above, was digested With Kpnl and Hmd II and the fragment of 
MtoSlu isolated. To build back a amino terminus corresponding to a natural mature enzyme, an 
oligonucleotide linker with the sequence , 

5 ' CATGTCATTTTCCAATGAATGCCTTGGTAC 3 ' 
3 , AGTAAAAGGTTACTTACGGAACg , 



was licked to this DNA fragment. The resulting fragment contained one Nco I compatible sticky-end and 
Z M M sticky end. The E. Coli expression vector P KK233-2 was obtained commercia^om Pha mac* 
and digested with restriction^"^ Nco 1 and Hind .... The large hnear fragment was plated and l^ated 

30 to the linker adapted cDNA fragment. The ligation mixture was used to transform competent E. Coh JM105^ 
Transformants were selected by ampici.lin resistance and the clones isolated were analyzed for he 
recombinant plasmid by restriction enzyme and DNA sequence analysis to confirm the str ucture of the 
expression vector (hereafter "pAE12") that they contained. The expression vector contains he hybrid trp-lac 
p o P moter whS s repressed by the lac repressor and inducib.e by treatment of the ee ls witf , .sopropyl- 

35 Lgalactoside (IPTG). Upstream from the initiator methionine the vector also contams the sequences of a 

nVo^TeTeSnlT the alpha amidating enzyme in the E. Coli. the recombinant cells were grown 
wi «h?hatng in LB-broth at 37' C to an OD 60 o of 0.4. IPTG was added to the culture to a final concentration 
o hi! and he growth was allowed to continue at 37* C with shaking for three to five hours. Cells were 
40 llected by conjugation of .he culture and the supernatant was discarded The cells were , resuspe ded m 
buffer containing a coctaii of protease inhibitors, treated with lysozyme and then sonicated to lyse he eel 
melanes The lysates were centrifuged at 12.000 x g to separate the soluble and insoluble , fractions o, 
the cells. Each fraction was analyzed by SOS-PAGE and protein staining. The ^a am.datmg en yme 
protein was readily identified as an IPTG inducible product in the insoluble protein fraction. Since rtte .n*a. 
45 expression plasmid did not contain a stop codon specified by the alpha amidating ^^""^l 
. the inducible product formed contains sequences specified by downstream vector DNA fused to Jta C- 
termina. of the alpha amidating enzyme protein sequences. In addition the '"duced ins^b protein a.so 
contained a smaller amidation enzyme specific protein that represented a produc formed by inl iemal 
initiation of protein synthesis at a cryptic RBS and initiation codon (am.no acid pos.t.on 221 of the alpha 

50 tSSS^. the Ctermina. portion of the expressed product, a muto«on of 
• the lysine codon at position 71 6 of the type A sequence was made to generate a TAA stop codon a this 
positton The mutated' cDNA was then digested with Kpn . and Eco Rl and used »^ * ^£ 
i-Eco R1 fragment in the initial expression vector pAE12. In a s,m.lar fash.on. the type f ^ N K A J e .^" C ^ 
55 were mutated at the comparable position (amino acid 821) to create a stop codon «J*« "^g 0 ^ 
fraoment from the mutated type B cDNA was used to replace the corresponding fragment in P AE12. The 
Lo expresston plaTmids so created P AE24 (type A) and pAE25 (Type B) were then used to transform 
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JM10S. The resulting strains were cultured for expression as was done previously for pAE12-contain.ng 
strains The PAE24 was found to produce two IPTG inducible, insoluble proteins of approximately 75 kD 
and 55 kD while the pAE25 was found to produce two IPTG inducible insoluble proteins of about 87 kD and 
67 kD. Again, the small protein in each of these pairs represents the unwanted amino-term.nally truncated 
product from either the type A or type B cDNA. Miti<Mnn 
To eliminate the initiation of protein synthesis at the cryptic internal nbosome binding site and .nit.at.on 
codon (amino acid position 221) the GTG start codon. (GTG can serve as an initiate r me tcodon « 
bacteria), was converted to a GTT codon that cannot initiate protein synthesis but wh.ch st.ll encodes the 
valine that is normally found at this position in alpha amidating enzyme proteins encoded by natural genes. 
Wnen the mutated region of the cDNA was substituted for the natural sequence in the «*P™aon vec l0 « 
PAE24 and PAE25, two new vectors were created. pAE31 and pAE32. Transforming E. Coh JM105 w.th 
these modified expression vectors and testing protein production from the resulting recombmant strains 
indicated that this mutagenesis was effective in eliminating the unwanted internal initiation. The IPTG 
induced product from the host cells carrying pAE31 was found to be 75 kD while that from cells 
is transformed with pAE 32 was found to be 87 kD. 

Since we have found that naturally occurring amidating enzyme from type B cDNA is post-tran- 
slationally processed to give proteins of approximately 43 kD. we have prepared a series of mutations in 
type B AE cDNA that allows expression of proteins that terminate at or near the position where the naturally 
processed enzyme ends. Two of these mutations were prepared by oligonucleotide mutagenesis while a 
third was created by adapter-linker mutation as indicated above. When cDNAs carrying these mutations 
were used to replace the corresponding segments of pAE32. transformed into JM105 and analyzed for 
protein production in experiments similar to those described above, truncated alpha amidatmg enzyme 
proteins were detected. With a mutation at amino acid position 396 of type B cDNA changmg a natura 
tyrosine codon to a stop codon (pAE36). a 39 kD enzyme protein was found while a linker mutagenesis that 
ended translation a» the histidine codon of amino acid 464 resulted in a vector. pAE51. which produced a 
recombinant alpha amidating enzyme protein of 46 kD following transformation and induction of E. col. 

JM1 AU of recombinant alpha amidating enzyme proteins produced in E Coli described above were found to 
segregate with the insoluble fraction of the cell extracts. The enzymes could be rendered soluble and act.ve 
by treatment with 8 M urea followed by rapid dilution in 50mM Tris-HC! pH7. When E. Coli JM105 carry.ng 
P AE12 was grown and induced with IPTG as described, the alpha amidating enzyme proteins were present 
at levels of at least 30 mgs per liter of bacterial culture. 

Representative samples of the induced insoluble protein produced in E. coh carry.ng AE express.on 
plasmids are shown in Figures 3 and 4. 
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Generation of mammalian expression vector pd BPV-MMTNEO-AE A 7s 

To generate a mammalian expression vector which expresses and constitutively secretes 75 kD type A 
alpha amidating enzyme from mammalian cells (see the flow chart of Figure 1). the following was 

45 P 0 ^^ jntermediat9 expression vector pdMMTNeo (commercially available from American Type Culture 
Collection) (as shown) was digested with Bgl II. The linear form was isolated and purified. 

2) The recombinant type A cDNA conlaining the full prepro sequence and an artificial stop codon i aa 
at position 2146-2148 was isolated by sequential digestion with Bgl I and Xho I. The fragment correspond- 
so ing to alpha amidating enzyme was then isolated and purified. 

3) The insert (type A alpha amidating enzyme) and vector. (pdMMTNeo) were mixed and the 
corresponding ends were made flush using the Klenow fragment of DNA polymerase I. The 5 protrud.ng 
segments were filled in with added dNTP. and the 3 protruding segments were digested back to produce a 
flush end (alternatively sequential SI nuclease and Klenow + dNTP could be utilized for produc.ng flush 

55 ends). The flush ended molecules were then ligated for 16 hours at 15 C. 

4) The ligated material was then transformed into E. Coli RRI. Recombinant clones were selected .n the 
presence of 50 ug/ml ampicillin. The orientation of the insert in the recombinant clones was verified using a 
battery of restriction enzymes. One clone which was referred to as pdMMTNeo a -AE A7S (clone 11) was 
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determined to have the type A cDNA The 

linearized vector -wm .^ated and punfted and then^ isolated and purified following B and 

hours at 37 C to remove J •***££ l J^ im SamHI fragment of BPV-1 ONA. which is approxi- 
BamHI digesuon of the vector ^PVMMTNeo . ^ 9 pdMMTNeo «AE M , vector, for 3 

"^i^nScLtewere transfected with 20 ug of pdBPV-MMTNeo aAE A „ by the standard CaPO* 

wmmmmmm 

when practicing the present invention, as defined by the following claims. 
Claims 

1 A host capable of expressing the polypeptide sequence of an alpha amidating enzyme said host 
sequence of: 
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45 3. A host capable of expressing the polypeptide sequence of an -J P ha am.dat.ng enzyme said host 
comprising an expression vector containing a transcriptional promoter followed downstream by a DNA 
sequence foreign to said host which is capable of hybridizing under stringent cond.t.ons w.th a DNA 
sequence of: 
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ttctoccmcmcc.tk.Jcctccc^^ 

cem»wTuc»<^o*T«TceM^»«««wii«!^..... — ♦ — T » r » • « 

4 A host acceding to on, preceding claim. vmerein too ONA seguenr* includes a stop codoo 

^zztt&zz* .ho o V o S ,oo o, . **rzzszzs£i 

amiLt, onU in proKaryo* cells, said vec* ^J^S^iTS ^ 
stream by a fast ONA sequence having an amtdabng • nw ™£J^2tI*£JL e „ 2ym8 to uodorgo 
suHiciontty homologous to a natural DNA taw" 'V*'!^?, ^TS SMuS'ncWing an 
hvbridisaeon «Mi said natoial sequence undec stringent cooditroos, and said Irrst sequence . rnc a 

oodon «Hh,n T 50 - rr: ^o -«S7*^, 
7 An expression vector according to claim 6. wherein me nrsi u«oa ^ 

within about 50 nuclootidos upstream trom the start of the Z°„ wme< ,p,ee S ing sequence 

domain. 
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11. An expression vector according to claim 10. which has no interna, initiation site between the 
initiating methionine codon and the stop codon. sequences coding for ribosome 
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defined in claim 2 or claim 3. nNA seauence is as defined 

^ An expression vector according to claim 14. wherein the natoral DNA seouence is as defined in 

Calm s « PS ts S«L by 119*0 said r,ra. DNA seorrenc. into an orrprearnon system solaced 
IrorrTpdBPVMMTNeo. pSV, pHSV. pMAMNeo. peuK-C, « Or .0 an, 0, 

19 A prokaryotio host comprising an expression veclor accontog to any of crams »»rc« 

n " ^rr;rcr>i » - * « - » - - - - 

Cl *" 2 , A L^SSSS,* *m * which is selected hpm M doposi, ,0,9. He,. CV,. C97. 
sequence as defined in claim 2 or claim 3. 



35 



45 



55 



21 



EP 0 382 403 A2 



Bglll 




— i mM W IP I »tkmm • « » * : 

r«c,u eingerfciwui / few::; { 



LftOU till 



ATG 

I 



BamHI 



Bgll 



TAA 
Xhol 



Type A *AE 

1SI nuclease 
Klenow+dNTP 
21 



|Klenow+ dNTP 




BamHI 



FIG. 1 



• j • • 
• • •• 



EP 0 362 403 A2 



PstI 




EcoRI 
HindUI 



DIGEST WITH 
KpnI+Hindlll 



XmnI 



PvuII 




Ncol PtH HlndlH 

|AQQA| AACAQAteCATQ rfCT OCA Q'CC'aAQ CTl' 
' mtt ata ala ala lya 



DIGEST WITH 
NcoI+HindHI 



ISOLATE 2.1kb 
KpnI/Hindlll FRAGMENT 
Ncol Kpnl 



f 



LIGATE 



NcoI/HindlH cDNA FRAGMENT 
I 



ISOLATE 4.6kb 
LINEARIZED VECTOR 



LIGATE 



PvuII 



FIG.2 




REMOVE 5* PHOSPHATES 
I 



HindUI 
EcoRI 



Hindi 



EcoRI 



Kpnl 



EP 0 382 403 A2 



^eavd® } ^ ; ||J It! 



MS 3Vd ^ + 





o 



4 A A A 

CO CO CO 0> 
? (O * CM 



9G3Vd 5 + | 
COI ft 

rt «el * | 

» 3¥d « + I I > I 

0 CM I I 

AAA A 



CO CO CO 0> 

CM 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

^^fBLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: . 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



